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BACKGROUND 



A wide variety of information systems may employ 
information servers. Information servers may be used 
5 to provide access to data stored on the persistent 
storage devices. A data center, for example, usually 
includes a set of information servers that provide 
access to data that is persistently stored on a set 
of disk drives in, the data center. 

10 

Typically, an information server services 
information access transactions that target data 
stored on persistent storage devices. Examples of 
information access transactions include SQL 
15 read/write/modify transactions. 

A typical information server includes an 
internal memory that may be used as a cache for data 
obtained from persistent storage. The caching of data 
20 in an internal memory of an information server 

usually improves response time of the information 
server when handling information access transactions 
for which data held in the cache. 

25 It is often desirable to reduce the power 

consumption of an information server. In a data 
center, for example, it may be desirable to the 
reduce power consumption of its information servers 
to reduce overall power consumption in the data 

30 center. In addition, it may be desirable to reduce 
the power consumption of the information servers to 
reduce heat in the data center environment. A 
reduction in heat in a data center may increase the 
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reliability of hardware in the data center and may 
enable more density in data center hardware and may 
reduce costs associated with over-provisioning. It 
may also be desirable to reduce the power consumption 
in a manner that avoids a severe negative impact on 
the overall response time of an information server 
when servicing information access transactions. 
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SUMMARY OF THE INVENTION 

An information server is disclosed with power- 
aware adaptation that enables power reduction while 
5 minimizing the performance impact of power reduction. 
An information server according to the present 
techniques includes a transaction prioritizer that 
determines which of a set of memory subsystems in the 
information server is to cache a set of data 

10 associated with each incoming information access 

transaction and further includes a power manager that 
performs a power adaptation in the information server 
in response to a set of ranks assigned to the memory 
subsystems. An association of priorities of the 

15 incoming information access transactions to 

appropriately ranked memory subsystems and the 
judicious selection of memory subsystems for power 
adaptation enhances the likelihood that higher 
priority cached data is not lost during power 

2 0 adaptation. 

Other features and advantages of the present 
invention will be apparent from the detailed 
description that follows. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

The present invention is described with respect 
to particular exemplary embodiments thereof and 
reference is accordingly made to the drawings in 
which: 

Figure 1 shows an information server according 
to the present teachings; 

Figure 2 shows a method for power-aware 
adaptation according to the present teachings; 

Figure 3 shows a data center that incorporates 
the present teachings. 
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DETAILED DESCRIPTION 

Figure 1 shows an information server 10 0 
according to the present teachings. The information 
5 server 100 enables access to data that is stored in a 
set of persistent storage devices 30-34. The 
information server 100 includes a main memory 40, a 
set of information access code 50, and a power 
manager 20. 

10 

The information access code 50 obtains 
information access transactions via a communication 
path 32. The information access code 50 performs 
read/write accesses to the persistent storage devices 

15 30-34 as needed to service the received information 
access transactions . A received information access 
transaction may specify a read, write, modify, etc., 
of data that is stored on the persistent storage 
devices 30-34. An information access transaction may 

20 take the form of an SQL transaction. 

The information access code 50 uses the main 
memory 4 0 as a cache for data stored in the 
persistent storage devices 30-34. The caching of data 
2 5 in the main memory 4 0 enhances speed with which the 
information server 100 may respond to an information 
access transaction when the data targeted by the 
information access transaction is held in the main 
memory 40 . 

30 

The main memory 4 0 is subdivided into a set of 
memory subsystems 10-16. The power status of each of 
the memory subsystems 10-16 is independently 
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controllable by the power manager 20. For example, 
the power manager 2 0 may independently switch on/off 
each of the memory subsystems 10-16 or place each of 
the memory subsystems 10-16 in power reduction mode 
5 or remove each of the memory subsystems 10-16 from a 
power reduction mode. In one embodiment, the main 
memory 4 0 is comprised of random access memories that 
are arranged into banks wherein the power state of 
each bank is individually controllable. 

10 

The information access code 50 includes a 
transaction prioritizer 52 that examines each 
information access transaction received via the 
communication path 32. The transaction prioritizer 52 

15 assigns a priority to each information access 

transaction. The priority assigned to an information 
access transaction determines which of the memory 
subsystems 10-16 of the main memory 4 0 is to be used 
to cache data associated with the information access 

20 transaction. The priority may be based on a service- 
level agreement between the provider of the 
information server 100 and the client that originates 
the ' information access transaction. 

25 In addition, each of the memory subsystems 10-16 

is assigned a rank for use in power adaptation in the 
information server 100. The memory subsystems 10-16 
may be ranked in any manner. For example, if there 
are N of the memory subsystems 10-16 then the memory 

3 0 subsystem 10 may be assigned a rank=l and the memory 
subsystem 12 a rank=2, etc., or visa versa. Any 
numbering system or rank indicators may be used. More 
than one of the memory subsystems 10-16 may be 
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assigned the same rank and there may be any number of 
ranks assigned. 

The power manager 2 0 monitors the power 
5 consumption of the information server 100 and/or 

environmental and/or other conditions associated with 
the information server 10 0 and performs power 
adaptation when appropriate. In one embodiment, the 
power adaptations by the power manager 2 0 are 
10 triggered automatically - for example through 

heuristics programmed into the power manager 20. 

For example, an excessive amount of power 
consumption of the information server 100 or 
15 excessive heat in the environment of the information 
server 100 may cause the power manager 2 0 to perform 
power adaptation by switching off one or more of the 
memory subsystems 10-16 or by placing one or more of 

td 

the memory subsystems 10-16 in a reduced power state. 

2 0 The power manager 2 0 may implement any method of 

tradeoff between power and performance when selecting 
a power adaptation mode for the subsystems 10-16. For 
example, a reduced power state may provide less power 
savings than a power off state but still provide the 
25 performance benefits of caching. 

In another example, if the load of information 
access transactions received via the communication 
path 32 is relatively high then the power manager 20 

3 0 may perform power adaptation by switching on one or 

more of the memory subsystems 10-16 that are in a 
power off state. Similarly, if the load of received 
information access transactions is relatively high 
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then the power manager 20 may perform power 
adaptation by removing the power reduction state of 
one or more of the memory subsystems 10-16 that are 
in a reduced power state. The power manager 2 0 or 
5 some other element in the information server 100 may 
implement mechanisms for measuring response time.to 
information access transactions so that an increase 
in response time may trigger power adaptation. 

10 The above provide a few examples of conditions 

that my trigger power adaptation. A variety of 
conditions may cause the power manager 2 0 to trigger 
power adaptation/ 

15 In addit ion, the power adaptations in the 

information server 100 may be triggered manually - 
for example through the intervention of a system 
administrator. For example, the power manager 2 0 may 
generate one or more web pages that enable manual 

20 power control using web protocols via the 
communication path 32. 

The power manager 20 selects the memory 
subsystems 10-16 to be powered down or to be placed 

25 in a power reduction state on the basis of their 
assigned rank. For example, the power manager 20 
initially powers down the memory subsystem 10-16 
having the lowest rank that is currently in a full 
power state and then powers down the memory subsystem 

30 10-16 having the next lowest rank that is currently 
in a full power state, etc., as needed to accomplish 
the appropriate power adaptation. 
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In addition, the power manager 20 selects the 
memory subsystems 10-16 that are to be restored to a 
full power state' on the basis of their assigned rank. 
For example, the power manager 20 initially restores 
5 to full power the memory subsystem 10-16 having the 
highest rank that is currently in an off state or a 
reduced power state and then powers up the memory 
subsystem 10-16 having the next highest rank that is 
currently in an off or reduced power state, etc., as 
10 needed to accomplish the appropriate power 
adaptation. 

The power manager 2 0 may notify the information 
access code 50 of upcoming changes in the power 
15 status of the memory subsystems 10-16 so that the 

corresponding cached data may be handled accordingly. 
For example, any Mirty" data in the memory 
subsystems 10-16 may be written back to persistent 
storage . 

20 

The information access code 50 selects one of 
the active memory subsystems 10-16 to cache data for 
a received information access transaction based on 
the priority assigned to the received information 

25 access transaction by the transaction prioritizer 52 
and the ranks of the memory subsystems 10-16. The 
information access code 50 selects one of the active 
memory subsystems 10-16 for caching data for an 
information access transaction by matching a priority 

30 of the information access transaction to the ranks of 
the memory subsystems 10-16. The memory subsystems 
10-16 having a high rank are selected for the 
information access transactions having a high 
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priority and the memory subsystems 10-16 having a low 
rank are selected for the information access 
transactions assigned a low priority. 

5 The priorities assigned to the information 

access transactions may employ a system similar to 
the ranking of the memory subsystems 10-16. For 
example, if the memory subsystems 10-16 are ranked 
from 1 to N then a received information access 

10 transaction may be assigned a priority between 1 and 
N by the transaction prioritizer 52. In such an 
embodiment, an information access transaction having 
a priority=l will be cached by the memory subsystem 
10-16 having a rank=l and an information access 

15 transaction having a priority=2 will be cached by the 
memory subsystem 10-16 having a rank=2, etc. 
Alternatively, any type of mapping between ranks of 
memory subsystems 10-16 and priorities of information 
access transactions may be used. 

20 

If a matching low ranking memory subsystem 10-16 
is not active when a low priority information access 
transaction is received then the information access 
code 50 selects the lowest ranking active memory 

25 subsystem 10-16. In the example 1-N ranking and 

priorities, when the memory subsystem 10-16 having a 
rank=l is not active an information access 
transaction having a priority=l will be cached by the . 
memory subsystem 10-16 having a rank=2 if it is 

30 active or by the memory subsystem 10-16 having a 
rank=3 if it is active, etc. 

The priorities assigned to the incoming 
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information access transactions may be derived using 
any method. The priority of an incoming information 
access transaction may be included in the information 
access transaction. The priority of an incoming 
information access transaction may be derived from 
information contained in the information access 
transaction. 

For example, clients associated with an 
information access transaction may pay more money in 
exchange for a higher priority on their transactions. 
The priority may be derived from an identity of an 
originator of the information access transaction. An 
originator of an information access transaction may 
be identified in any manner - for example using an IP 
address. 

In another example, the transaction prioritizer 
52 may analyze and compute statistics on information 
access transactions and assign priorities 
accordingly. 

In another example, the priority of an 
information access transaction may be based on the 
data targeted by the transaction so that some data in 
the persistent storage devices 3 0-34 is deemed higher 
priority than other data. 

The present techniques may increase the 
likelihood that data for high priority information 
access transactions will be cached in active memory 
subsystems because the memory subsystems that handle 
lower priority transactions are powered down first. 
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This minimizes the performance degradation that might 
otherwise occur if the memory subsystems 10-16 were 
to be powered down without regard to their rank, i.e. 
the priority of information access transactions whose 
5 data they cache. 

Figure 2 shows a method for power -aware 

-i 

adaptation according to the present teachings. At 
step 200, a rank is assigned to each of the memory 
10 subsystems 10-16. The following focuses on an example 
embodiment in which the memory subsystems 10-16 
include a set of 4 nodes which are assigned the ranks 
1 through 4, respectively, at step 200. 

fa 

15 At decision step 202, if a power reduction type 

of power adaptation is triggered then step 204 is 
performed and if a removal of power reduction type is 
triggered then step 2 06 is performed. Power reduction 
may be triggered by an excessive power consumption in 

20 the information server 100 or excessive heat in the 
environment of the information server 100 or by a 
combination of these factors. Removal of power 
reduction may be triggered by a slow response time to 
information access transactions by the information 

25 . server 100 or an increase in memory bandwidth 

contention or a reduction in environment heat or a 
combination of factors. 

At step 204, the lowest ranking active memory 
30 subsystem 10-16 is adapted for reduced power 

consumption. The selected memory subsystem 10-16 may 
be adapted for reduced power consumption by powering 
it down, i.e. switching it off, or by using other 
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methods of power control . 

For example, if the memory subsystems 10-16 are 
all active then the memory subsystem 10 may be 
5 powered down at step 204. This results in the loss of 
cached data for the lowest priority information 
access transactions which is normally held in the 
lowest ranking memory subsystem 10. At step 204, if 
the memory subsystems 12-16 only are active then the 
10 memory subsystem 12 may be powered down resulting in 
the loss of its relatively low priority cached data. 

At step 2 06, the highest ranking reduced-power, 
e.g. powered down, memory subsystem 10-16 is adapted 
15 to remove power reduction. A selected access node may 
be adapted to remove power reduction by powering it 
up, i.e. switching it on, or by using other methods 
of power control . 

20 For example, if the memory subsystems 10 and 12 

are inactive then the memory subsystem 12 may be 
powered up at step 206 because its rank is higher 
than the rank of the memory subsystem 10. This 
recreates the capacity to cache data in the memory 

25 subsystem 12. 

Figure 3 shows a data center 3 00 that 
incorporates the present teachings. The data center 
300 includes a set of storage devices 330-336, and a 
30 set of information servers 320-326 that provide 

access to data stored on the storage devices 320-326. 
The data center 3 00 includes a switching mechanism 
314 that enables access to all of the storage devices 
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330-336 by all of the information servers 320-326. 

The storage devices 33 0-33 6 provide large scale 
persistent storage of data for applications 
5 implemented in the data center 300. In a database 

application, for example, the storage devices 330-336 
provide a persistent store for database tables and 
records, etc. 

10 The information servers 320-326 obtain incoming 

information access transactions via an internal 
network 312. In a database application in the data 
center 300, for example, the information access 
transactions may be database reads, writes, queries, 

15 etc. The data center 300 may include a set of 

application servers and a set of web servers that 
generate the information access transactions in 
response to web client interactions via a network 
communication path to the data center 300. 

20 

The information servers 320-32 6 perform reads 
from and/or writes to the storage devices 330-336 via 
the switching mechanism 14 to access persistent data 
as needed when carrying out the information access 
25 transactions. Any one or more of the information 
servers 320-326 may perform the power adaptation 

■ 

methods disclosed above. The power adaptations in the 
information servers 320-326 may be triggered 
automatically or manually through the intervention of 
30 . a system administrator. 

The foregoing detailed description of the 
present invention is /provided for the purposes of 
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illustration and is not intended to be exhaustive 
to limit the invention to the precise embodiment 
disclosed. Accordingly, the scope of the present 
invention is defined by the appended claims. 
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